Auto-generating Bilingual Dictionaries: Results of the TIAD-2017 Shared Task Baseline Algorithm
نویسنده
چکیده
Inferring a bilingual dictionary L1>L3 given two bilingual dictionaries L1>L2 and L2>L3 is a non-trivial task, as seen in reports of large-scale, computationally-heavy experiments published in recent years (Soderland et al. (2009); Shezaf and Rappoport (2010)). Early works on this (cf. Tanaka and Umemura (1994)) have already noticed that the main obstacle in such inferences stems from the fact that polysemy is not isomorphic across languages, and often a monosemous lexical item in L1 can be polysemous in L2. In this paper, we present a new set of experiments on automatically generating bilingual dictionaries based on existing ones. The data used are a commercial set of bilingual dictionaries with a particular topology when viewed as a graph connecting source and target languages. We find that searching for cycles in this graph is an effective method for generating translation inferences, and reflect on the impact of the source data’s structure on these results and directions for future research.
منابع مشابه
On multiword lexical units and their role in maritime dictionaries
Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...
متن کاملAn Investigation into Bilingual Dictionary Use: Do the Frequency of Use and Type of Dictionary Make a Difference in L2 Writing Performance?
Bilingual dictionary use in L2 writing test performance has recently been the subject of debate. Opinions differ according to how the trait is understood and whether the system favors the process-oriented or product-oriented views towards the assessment and writing skill. Given the need for more empirical support, this study is aimed at investigating the availability of bilingual dictionary use...
متن کاملAuto-extracting Paraphrases of Letter-word Phrases in Live Texts
In this paper we will discuss the Auto-extraction of paraphrases of letter-word phrases in live Chinese texts. The paper discusses the modes of conventional dictionaries firstly, and then gives the principles of paraphrase of letter-word phrases; with an analysis of the examples of letter-word phrases paraphrases secondly, and then gives their formalized denotations and presents an auto-recogni...
متن کاملComputational bilingual lexicography: automatic extraction of translation dictionaries
The paper describes a simple but very effective approach to extraction translation equivalents from parallel corpora. We briefly present the multilingual parallel corpus used in our experiments and then describe the pre-processing steps, a baseline iterative method, and the actual algorithm. The evaluation for the two algorithms is presented in some details in terms of precision, recall and pro...
متن کاملیادگیری واژه نامه برای آشکارسازی محل پلاک خودرو
Car license plate detection has been always a challenging task in the context of traffic control and traffic offenses. In this paper, the problem of license plate detection from gray scale images taken in natural conditions is addressed. Our car plate database consists of images with severe imaging conditions such as low quality images, far distanced cameras, and severe weather conditions. The ...
متن کامل